Research on Speech Synthesis Based on Mixture Alignment Mechanism
نویسندگان
چکیده
In recent years, deep learning-based speech synthesis has attracted a lot of attention from the machine learning and communities. this paper, we propose Mixture-TTS, non-autoregressive model based on mixture alignment mechanism. Mixture-TTS aims to optimize information between text sequences mel-spectrogram. uses linguistic encoder soft phoneme-level hard word-level approaches, which explicitly extract semantic information, introduce pitch energy predictors optimally predict rhythmic audio. Specifically, introduces post-net five-layer 1D convolution network reconfiguration capability We connect output decoder through residual network. The mel-spectrogram is converted into final audio by HiFi-GAN vocoder. evaluate performance AISHELL3 LJSpeech datasets. Experimental results show that somewhat better in mel-spectrogram, able achieve high-quality ablation studies demonstrate structure effective.
منابع مشابه
mortality forecasting based on lee-carter model
over the past decades a number of approaches have been applied for forecasting mortality. in 1992, a new method for long-run forecast of the level and age pattern of mortality was published by lee and carter. this method was welcomed by many authors so it was extended through a wider class of generalized, parametric and nonlinear model. this model represents one of the most influential recent d...
15 صفحه اولthe effects of speech rate,prosodic features, and blurred speech on iranian efl learners listening comprehension
کلید واژه ها به زبان انگلیسی: effect of speech rate on listening comprehension, blurred speech,segmental and suprasegmental features,authentic speech,intelligibility, discrimination, omission, assimilation چکیده: سرعت مطالب شنیداری در کلام پیوسته بطور کلی همواره کابوسی بوده برای یادگیرنده های زبان دوم و بالاخص برای شنوندگان ایرانی. علی رغم عقل سلیم که کلام با سرعت کندتری فعالیتهای درک مطلب شن...
15 صفحه اولconstructing gender identity through narratives based on hallidays metafunctions
هویت, شکل دادن و بازنمایی آن در گفتمان, توجه بسیاری از محققان این رشته را به خود جلب کرده است. تحقیق حاضر بر شکل دادن به هویت جنسیتی هشت تن از دانشجویان ایرانی مشغول به تحصیل در دوره کارشناسی ارشد از طریق بررسی روایات آنان از تجربیات شخصی, متمرکز شده است. تحلیل داده ها در این تحقیق مشتمل بر سه بخش است: بخش اول شامل کدگذاری موضوعی روایات است که بر اساس آن هویت جنسیتی شرکت کنندگان در تحقیق بر اسا...
15 صفحه اولA Research on Mixture Splitting for CHMM Based on DBC
EM (expectation-maximization) algorithm is a classical method for parameter estimation of HMM (Hidden Markov model). Concerning that EM algorithm is easily affected by initial parameter values, a mixture splitting algorithm based on decision boundary confusion(DBC) was proposed to describe more about boundary distribution. The algorithm mainly includes four aspects: firstly the number of increm...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Sensors
سال: 2023
ISSN: ['1424-8220']
DOI: https://doi.org/10.3390/s23167283